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Abstract 


The paper is concerned with two-person games with saddle point. We investigate the 
limits of value functions for long-time-average payoff, discounted average payoff, and the 
payoff that follows a probability density. 

We provide two results. The first one is a uniform Tauber result for games: if the 
value functions for long-time-average payoff converge uniformly, then there exists the 
uniform limit for probability densities from a sufficiently broad set; moreover, these limits 
coincide. The second one is the uniform Abel result: if a uniform limit for self-similar 
densities exists, then the uniform limit for long-time average payoff also exists, and they 
coincide. 

For bounded sequences, Hardy proved that the convergence of their Cesaro means is equiv¬ 
alent to the convergence of their Abel means. This result has been generalized by Feller to the 
case of uncontrolled deterministic dynamics in continuous time; [12] to deterministic controlled 
dynamics with dependence on the initial data: if there exists a uniform limit of values for one 
of the payoffs 




(Cesaro mean), 


(Abel mean), 


then the other exists too and they are equal. 

Many papers are devoted to the subject of existence of limits of such values; let us first of 


all note [2] EUS1E1ISUI5]; see also the reviews in mm- As proved in in. for a stochastic 
two-person game with a finite number of states and actions, optimal long-time averages and 
optimal discounted averages share the common limit; for repeat games, see [6]. For differential 
games, the limits exist in special cases (first of all, in the nonexpansive-like case); [1[ 0, \ 7 \. 
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The general result that mirrors the result from [12] for dynamic games (including differential 
games) was proved in [9], 

In addition to the functions ^1[ 0 > t], Xe~ xt , one could consider arbitrary probability den¬ 
sities, and then the asymptotic behavior of the value functions for the following games. For 
discrete time systems, [14] expresses the conditions of existence of a uniform limit that is shared 
by all probability distributions. In HU, it was noted that if a probability density is nonincreas¬ 
ing, the payoff can be expressed as a convex combination of the Cesaro means. Consequently, 
that paper proved that, for discrete time systems, there exists a uniform limit for such distribu¬ 
tions and it coincides with the limit of the long-run-average if the latter exists and is uniform. 
A similar approach was used in [B] for repeat games. 

In this paper, we show uniform Tauberian theorems for dynamic two-person games with 
saddle points. Most of our assumptions restrict the dynamics of games. In particular, like 
in pT2] , we assume the close d n ess of strategies under concatenation. It is also necessary for 
the value function to satisfy Bellman’s optimality principle, even if in a weakened, asymptotic 
sense. 

We provide two results. The first one is a uniform Tauber result for games with saddle point: 
the uniform convergence of value functions for long-time average payoff implies the existence of 
a uniform limit of value functions for all probability densities from a sufficiently broad set (of 
bounded variation type); moreover, these limits coincide. The second one is the uniform Abel 
result for the games with saddle point: the existence of a uniform limit of value functions with 
self-similar probability densities implies the existence of a uniform limit of value functions for 
long-time-average payoff, and these limits coincide. 

The similar results for il[ 0 ,T], Xe~ xt were proved in [9j. Although our proofs follow [9], we 
use a weaker modification of Bellman’s optimality principle. This allows us to generalize the 
results of [12] in this article. 

1 General framework of dynamic system 

Assume the following items are given: 

• a set fl of states; 

• a set K of feasible processes, which is a subset of mappings from M>o to ; 

• a running cost g : Cl H > [0,1]; for each process z G K, assume the map t (->■ g(z(t)) is 
Borel- measurable. 

For each u G Cl , define r(a>) = {z G 1K| z(0) G Cl}, the set of all feasible processes z G IK 
starting from oo. 


Let us now define the operation of concatenation on processes. Let r G ^> q , z ', z " G IK be 
such that z'{t ) = ¥'(0). Then, their concatenation z' o T z" is defined by the following rule: 


{z'o T z")(t) = 


z'(t), t<T ; 

z"(t — r), t > t. 


Call a subset A of the set IK a playable strategy if, for every initial position u G hi, we 
have A n T(u;) ^ 0. 

Now, for each two playable strategies A ', A" and a time r G R>o , define concatenation by 
A' o T A" = {z'o T z" | ¥ G A', z" G A", z\t) = z"(0)}. 


Hereinafter set A o T i A' o T " A" = (H o T > Lb) o T » A". 

Define the following axioms for some family 21 of subsets IK: 


(p) 21 is some family of playable strategies; 

(cu) 21 allows the separation of c o (at the initial time): for each mapping £ : £2 — > 21 

3Hg21VwgD ((u)nr(w) = Anr( w ); 


(o) 21 is closed under concatenation o : Vr > 0, A', A" G 21 A! o T A" G 21. 


2 Formalization of lower and upper games 

Consider a two-player game. The first player wishes to maximize a bounded payoff function 
c : IK —> M; the second player wishes to minimize it. The first player also has his family 21 of 
playable strategies specified. 

The game is conducted in the following way: for a given w G fi, the first player demonstrates 
some set A in 21, and then the second player chooses a process z G v4nr(o;); we have a conflict: 

A G 21 ft c(z) 2 G A n r(cu). 

The value function of the lower game is 

¥ b [c](o;) = sup inf c(z) Vca G fl 

Note that, for every playable strategy A, AflT(w) ^ 0, c is bounded; therefore, the value 
for the lower game is valid. 

For the upper game, we still have two players, the first player maximizes the payoff c, 
whereas the second player minimizes it. Let the second player also have a family IB of playable 
strategies. 



The upper game is conducted in the following way. Given u G hi, let the second demonstrate 
some set B in 23; then, let the second player choose some process z G B ft T(u;). The value 
function of the upper game is 

V tt [c](o;) = inf sup c(z) Vo; G 0. 

z€Bnr(oj) 

Thanks to the boundedness of c and playable strategies in 23 , this function is valid as well. 

Remember that a game with the payoff function c has a saddle point if ¥ b [c] = V*[c]. We 
would also require a slightly weaker definition. 

Definition 1 Assign a bounded function c\ : IK —> R. to every positive X . Let us say that 
a game family with the payoff functions c\ has an asymptotic saddle point if the limit of 
|V b [cA](a;) — V^ca]^) | as A j. 0 exists, is equal to 0, and is uniform for usQ. 

We hereinafter impose the following axiom on the families 21, 23 : 

(s) An B fl r(cu) ^ 0 for any ^4 G 21, B e 6 0. 

The utility of conditions (s), (cu) is clarified by the following lemma: 

Lemma 1 Suppose 21,23 satisfy conditions (p),(oj). Assume K is equipped with a topology 
such that some bounded mapping c : K —» R is continuous. In addition, let cl And BflT(w) ^ 
0 for all oj G hi, A G 21, B G 23. Then, for all e > 0, there exist A £ G 21, B £ G 23 such that, 
for all oj GO , 

¥ b [c](a;) — e < inf c(z) < sup c(z) < ¥ a [c](a;) + e; 
nr(w) ^GB e nr(w) 

¥ b [c](w) < ¥ # [c](w). (1) 


In particular, these inequalities hold if 21, 23 satisfy conditions (p), (uj), (s) . 


Proof. Consider arbitrary w G fi,£ > 0; there exist maps f £ : — > 21, ( £ : fl —> 21 such 

that, for the payoff c under initial condition z(0) = oj, the strategy £ e (a;) is £-optimal in the 
lower game (the strategy <C e (a;) i n the upper game). Then, by the property of (cu), there exist 
strategies A £ G 21, B £ G 23 for which £ e (a;) flT(w) = TnT(w), C e ( w ) nT(w) = 5 e nT(u) for 
any oj G fl. Choose a common trajectory z £ (ou) G cl A £ fl cl B £ D T(u;). 

Since c is continuous, 


¥ b [c] (cu) — £ < 


inf c(z) 

(u)nr(Kj) 


inf c(z) < c(z £ (oj )) 
zeAe nr(w) v ' 


for all oj G fl. The converse inequality is proved for ¥* ) [c], B £ in a similar way. Now, since the 
choice of £ was arbitrary, these inequalities imply (ITT) . 

If we know that it is always A fl B flT(w) ^ 0 , then, because c would be continuous were 
IK equipped with the discrete topology, the proof is complete. □ 


3 On probability densities 


Consider a summable Borel-measurable function g in R(R>o,R>o)- Assign to it a payoff func¬ 
tion c[f?] : IK —* [0,1] by the following rule: 


<c[g](;s) = / g(t)g(z(t)) dt \/z G K. 

Jo 

Note that every c[o] is bounded by the number 1|p| |li(r> 0 ,k> 0 )- Therefore, the following defi¬ 
nitions are valid for all u ; G : 


V b [p]M = V b (u) = sup inf / g(t)g(z(t)) dt G M, 

Aea zeAnr(u) J 0 

poo 

V^](o;) = [c[^]] (tu) = inf sup / g(t)g[z(t)) dt G M. 

Bi 23 z€ />’ ! Jr, 


Recall that a Borel-measurable mapping g: M>o —> M>o is called a probability density function 
if T 

lim / git) dt = 1. 

T-s-oo J 0 

For every density g and number r G (0,1) there exists the quantile q[g](r ), i.e., the minimum 
number such that 




g(t) dt = r. 


Jo 

Naturally, in this case, “a game family with densities g\ has an asymptotic saddle point” 
means that the game family with the payoffs function c[^a] has an asymptotic saddle point. 

In what follows, for every interval [a, b) CK and function / : [a, b) — > KU {oo} , denote by 
Var b a [f] the total variation of the function / in [a, b). Consider the following definition: 


Definition 2 We say that the family of densities g\ (X > 0 ) is flat at zero if 

lim lim Var^ ex ^ [In £>a] = 0. (2) 

i—A4.0 

Definition 3 We say that the family of densities g\ (X > 0 ) is regular if, for each r G (0,1), 
the following upper limit is bounded: 


hmsup ^ [In £> A ] £ M. 

Define, for every A > 0, the probability densities w\, 7T\ by the rule 

A A . _xt 

®a — A 1 [0,1/A] , tt a — Ae 


For all r G (0,1), A > 0 , we have 

Varl^ x ^ r \hiW\] = 0, Varf nx ^ r \ In 7Ta] = — ln(l — r). 


( 3 ) 



Thus, the families (©a)a>o and (tta)a>o are flat at zero and regular. 

For every density g and parameter r G (0,1), define the function g l r ' ; R> 0 —* M> 0 by the 
rule 


g lr \t) = g(t + q[g}(r)) Vt > 0. 

Let us also define the payoff d r '[p] : IK —* M. for every A > 0, r G (0,1) by the following rule: 
for all 

a riie]{r) , x 

c |r> [p](^) = J g(t)g(z{t)) dt + V i [g ]r) ](^z(q[g}( y r))j. 

Now, for every r G (0,1), A > 0, we have 

= (1 - r)m^_, = (1 - r)7r A . 

1 — r 

Definition 4 The family of densities g\ is self-similar if, for each positive r 6 (0,1), A, there 
exists a positive h(X,r),u(X,r) such that 

Q l \ = h(X, r )p„ { A ,r). (4) 


Note that there are quite many such self-similar families. For example, for every Borel- 
measurable function / : M —y M >0 , one can define its self-similar family of densities p A (A > 0) 
by the following rule: 


Qi/rif) = -jr— -—- VT, t > 0. 

Jo /(T-r)dr 

In view of the definitions of q[p\)(r ), g^ , condition (TTIl implies 


1 — r = 




h(X, r) g u (\, r )(t)dt = h(X,r), 


Q l \ = (1 - r)g u{x ,r)- 


(5) 


Assume p A (0) = A for all A > 0. Then, for each r G (0,1), 

(1 - r)u(X,r) = h(X,r)g v (x t r)(ty = Q\{q[Q\](r)). 


Thus, 


limz/(A,r)=0 Vr G (0,1) 

A^O 


if and only if £?a(<?[£?a](V)) > 0 as A j. 0 for any r G (0,1). 

On other side, for every r G (0,1), A > 0, 


lugj((/[pj](r)) < ln®,(0) + VorS Iwl(r) [biSA]. 


( 6 ) 


Remark 1 If the self-similar family of g\ is regular, and p A (0) = A for all A > 0, then 
(17)1 holds. 



4 Main result. 


Theorem 1 Assume 21,25 satisfy conditions (p), (o), (a;), (s). 

Consider a self-similar family of densities Q\ (A > 0) such that 

1. every function Q\ satisfies £>a(0) = A; moreover, the family is flat at zero; 

2. the family satisfies ; for example, the family is regular; 

3. for sufficiently small r , the family of games with payoffs <c^[£>a] (A > 0) has an asymp¬ 
totic saddle point; 

4- a game family with the probability densities Q\ (A > 0) has an asymptotic saddle point, 
and there exists a common limit V* for V^Oa] and V^a] that is uniform in u £ . 

Then, the game family with probability densities = /xl[ 0 ,i/ M ] ( p > 0 ) has an asymptotic 
saddle point, and the common limit of values V : [g 7 m ] and V ^ [zufl also exists, is uniform in 
u ; 6 n, and coincides with V*. 

Theorem 2 Assume 21,25 satisfy axioms (p), (o), (a;), (s). 

Let a regular family of densities ^ (p > 0) satisfy 

lim [ qJt) dt = 0 VT > 0. (7) 

J Q 

For the family with probability densities w\ = A1[o,i/a] (A > 0) assume: 

1. under sufficiently small r, the game family with the payoffs (A > 0) has an 

asymptotic saddle point; 

2. a game family with probability densities w\ has an asymptotic saddle point, and there 
exists a common limit for V b [wfl and V ** [wfl that is uniform in co £ Ft. 

Then, a game family with probability densities ^ has an asymptotic saddle point, the com¬ 
mon limit of payoffs and also exists, is uniform in uj E Ft, and coincides with V*. 

Corollary 1 Assume 21,® satisfy conditions (p), (cu), (o), (s). 

Let a game family with payoffs c^^a] (A > 0) have an asymptotic saddle point for every 
family of densities and all r £ (0,1) . 

Then, the following conditions on the function V* : 12 —* [0,1] are equivalent: 

• for a certain regular self-similar family of densities Q\ with £>a(0) = A that is flat at 
zero and satisfies CD, there exists the common limit 

limV # [£ A ]M = lmiV^AjM = Vflu); 

that is uniform for 00 £ 12; 


( 8 ) 


• for a family of densities Q\ = A1 [ 0 ,i/a] , there exists the common limit of (T8~j) that is 
uniform for ui E O; 

• for a family of densities Q\ = Xe~ xt , there exists the common limit of (j~8~j) that is 
uniform for ui E O; 

• for every regular family of densities Q\ (A > 0) with (JTJ , there exists the common limit 
of © that is uniform for ca E fl. 

Note that the existence of an asymptotic saddle point for a family of games with the pay¬ 
offs c^[£>a] (A > 0) can be demonstrated in different ways. For example, under conditions 
(p), (u>), (o), (s), it is sufficient to prove that for every T > 0, any A E 21, B E 23 can be 
expressed in the form A = A o T A! , B = B o r B' for certain A! E 21, B' E 23 (see [9]). 

In some cases, not even that is necessary. 

Case of one-person games. 

Let us consider the example in the form of deterministic dynamic programming problem 
in continuous time. As in p2], assume the sets f2,K to be given; moreover, assume K to be 
closed under concatenation. 

Consider an arbitrary bounded payoff function c : IK —y R. Set 21 = {IK} . Now, 

V b [c](o;) = sup inf c(z) = inf c(z ) Vu E ft. 

Ae{K} ^6Anr(o;) *e r( w ) 

Consider all the possible selectors ( of the multivalued mapping Sl3w4 r(o;) C K; let 03 be 
the set of all possible images ((ft) of these mappings. Since K is closed under concatenation, 
condition (o) holds for 21 and 23 defined in such way. Conditions (p), (cu), (s) can be verified 
directly. Now, Lemma [1] implies that 

V^fcKo;) = sup inf c(z) = inf c(z) Vo; E fl. 

2GSnr(cj) zgr(w) 

Thus, a game with arbitrary bounded payoff c has a saddle point. Finally, 

Remark 2 For a one-person game under conditions of fWf, result of Corollary Q] holds. 

In particular, the condition of uniformity for the limit of fUTj) is in the general case indispensable 
for the results above; for the counterexample, refer to [12j . 


5 Proof of Theorems HE. 


5.1 Auxiliary statements. 

Remark 3 Under conditions of Theorem [H in view of 0,0. for all z G IK, 
[£>a] i z ) coincides with 

rq[e\](r) 

/ Qx(t)g(z{t)) dt + (1 - r)V t [g„(x,r)] (z(q[g x \(r))); 


moreover, by fljlj) , the family of games with payoffs 
rilex ](»•) 

/ Q\(t)g(z(t)) dt + ( 1 - r)V b [ft,(A, r )] (z(q[g x }(r))) 

Jo 

has an asymptotic saddle point for sufficiently small positive r. 

Lemma 2 Assume all conditions of Theorem\l\hold. Then, for all sufficiently small r G (0,1), 
there exist the limits 


1 ™V t [€ |r> [fo]](u) = lim¥ a [€ |r) [^ A ]](a;) = V*(u), 

A4-U A4-U 

and these limits are uniform for cu G fh 

Proof. By the condition, the family of games with the payoffs <c^[£> A ] has an asymptotic 
saddle point for all sufficiently small r G (0,1) . Fix one such r G (0,1). 

Let us show that, for all positive e < 1, V^c^^a]] > V* — £ for sufficiently small A > 0 . 
Suppose it is not; then, there exists a positive 5 > 0 such that, for every natural n , there exist 
a positive X n < 1/n and ui n G Ul such that V^c^^AjKk'n) < V*(uj n ) — 86. Fix that 6 < 1/8. 
But, there exists a natural n such that 

vW„,r-)] <V, + 6, V b KJ >K-& 

Fix such n, X n ,co n . 

Now, there exists A G 21 such that we obtain 

poo 

KO(0)) - 26 < V b [^ An ] (z(0)) - 6 < / g Xn (t)g(z(t))dt \/z G A; (9) 

Jo 

there exists B" G 03 such that, for all z G B ", we obtain 

V fi [^ (An ,r)](-(0 ))+6>f gu(x n ,r)(t)g(z(t))dt=-^—[ g l ? n (t)g(z{t))dt. (10) 

By definition of 5 , there exists B' G 23 such that, for all z G B' fl r(cu n ), we have 


c |r> [e»»]( z ) < V[c |r> [ex„]](w„) + 6 < V,(u„) - 7(5. 


( 11 ) 






Set T = q[p\ n ](r); since the left-hand side of this inequality depends only on z\[ 0 ,t]i the 
strategy B' can be substituted here with strategy B' o T B". 

Then, for any z E An (B' o T B") fl T(u; n ), 


f4 (cj n ) 


© 

25 < 




= / QXn{t)g(z{t))dt+ / Q^ ] n {t-T)g{z(t))dt 

Jo Jt 

coj , x in 

< <C |r> [^ A „](^) + 5 < V*{uj n )-§5. 


The obtained contradiction proves that V^c^^a]] > 14 — e for any e > 0 and sufficiently 
small A > 0 . The inequality V^c^^a]] < K + £, in view of Remark© is proved similarly. It 
remains to note that, by the condition, V # [c^[£>a]](<^) — V^c^^a]]^) -» 0 as A f 0, and this 
limit is uniform in uj E Q. □ 


5.2 Proof of Theorem |TJ 

In Section © we proved the following proposition: 

Proposition 1 Let all conditions of Theorem © hold. Then, for all positive e, there exists 
ft > 0 such that, for all positive p < ft for all uj E LI, we have ^[w^iuj) > V*(oj) — 6e. 

Define a function g~ = 1 — g. It is easy to see that, for a probability density g, relation 
g -\- g == 1 implies that, for all uj E LI, 

poo 

1 —'V^](a;)= inf sup / g(t)g~ dt. 

z&B Jo 

Conditions of Proposition © hold for the lower game with the probability density w running 
cost g~ , the limit value 1 — 14, and swapped capabilities of players 23, 21. Then, for all positive 
e for sufficiently small p > 0, we have 1 — > 1 — 14 — 6e. Now, in view of Lemma © 

we have V b [zz7 M ] < V 11 [ro M ] < 14 + 6e. As proved above, for sufficiently small positive p , we 
have 14 — 6£ < V b [Wff\. Since the positive £ was chosen arbitrarily, and 'V £ [zu^] have 

the common limit (as // —>■ 0); this limit equals 14 and is uniform in uj E LI. □ 

5.3 Proof of Theorem [2J 

Note that the family of w\ satisfies all conditions of Theorem© Then, Lemma © and Remark© 
hold for it. 

In Section © we proved the following proposition: 


Proposition 2 Let all conditions of Theorem [H hold. Then, for all positive e, there exists 
p > 0 such that, for all positive p < p for all u E LI, we have 'V b [<^ At ](a;) > V*(oj) — 6e. 

Now, a literal repetition of the proof of Theorem |T] with replaced with and Q\ 
replaced with w\ provides what was required. □ 


6 Proof of Proposition H 


According to the condition of Theorem [T] and the result of Lemma [2j for all positive x, for all 
r G (0,1), there exists a positive A(x, r ) such that, for all positive A < A(x, r ), 

K - V b [€ |r) [^ A ]] < x, V b [p A ] - K < x. 


Let us show that, for all positive e < 1/4, there exists p > 0 such that, for all positive 
p < p , we have V b > 14 — As. Fix a positive e < 1/4. 

By m, for some f G (0,1), every positive r < r has a positive A(r) such that, for a 
positive A < A(r), 

Par^ Al(r) [lnp A ] < 
now, for all nonnegative t < q[p\](r ), 


Q\{t) < e £/2 Q X ( 0) = e £/2 A < (1 + e)A, 


|p A (f) — A| dt < e\r 


( 12 ) 


as soon as A < A(r) ,r < q[g\](r),r < f for some positive f. Fix a positive f. 
There exists a natural k > 2/e > A such that ]j} A < f. Fix k . Set 


n — h -1 / fe x — - 

P-k , x- 3k ,_- 


By the condition, 



1 — p < — In p 


In k 


< r. 


(13) 


From (ED, we know that there exists a positive A (k) such that, for positive A < A (k ), we 
have A, z/( A, 1 — p) < A(x, 1 — p). Assume p = min (A (k), A(1 — pf) /k. Fix p < p. 

Set r 0 = 0 ; for each m — 1, 2 ..., k , set 




Now, in accordance with (1 13 1) , we have X m < kp < kg < X(k ); hence, X m , z/(A m , 1 — p) < 
AO, 1 -p), i.e., 


K-V b [c' 1 - p >0]]<x, 
O0( Am ,i-p)] -14 < x. 


(14) 

(15) 


Moreover, by (1 13 11 , 1 — p < f , A m . < kp < A(1 — p ) and t m = g[A m ](l — p ), we have (1 12 j) , 

i.e., 


Now, for all zeK, 


I £?A m (t) X m \ dt 5; £Xmtr, 


GG3 


(16) 


Q\m ( ( t )g{z{t)) dt < (1 + e) I X m g(z(t))dt 

a 

g(z(t)) dt ; 


= MP 


jm— 1 


(17) 


moreover, using £ < 1/4, t m = ^[pA m ](l — p) we obtain 


tmpP ^m^m(l 4“ ^) 


fTTJT ) i -|- £ r f ^ 

~ Jo 

< (l-p)(l + 3e). 


8\ m (t) dt = 


(1 ~P)(1 + g) 
1 — £ 


Thus, we have r k p < (1 — p)(l + 3£) (l +-h p k x ) < 1 + 3£, and 

O[0 ,r k \ /^l(l/Ai,(H-3e)/H - 


(18) 


By (1141) , 14 — x does not exceed the lower value 0(?A m ]] °f the game with the 

payoff cI 1-p Oa„J> be., with the payoff 

ptm 

/ Qx m (t)g(z(t)) dt + pV b [g HXm:1 _ p) ](z(t m )). 


In view of (1 15 p this, in its own turn, does not exceed the lower value of the game with the 
payoff 

/ PA m (0(^00) dt + pV*(z(t m )) + X. 

Jo 

By Lemma [H this game has a x -optimal strategy A m e 21; then, for all z G A m , 14 (z(0)) 
does not exceed 

rtm lid 

/ Qx m {t)g{z{t)) dt + pV*(z(t m )) + 3x < p m ~ l / pg{z(t)) dt + pV*(z(t m )) + 3x. 














Since the right-hand sides of these inequalities depend only on the strategy A m can 

be substituted here with any strategy that may be represented in the form A m o tm A ". Now, 
for all m — 1,..., k, A' , A” e 21, z G A' o Tml A m o Tm A" , we obtain 

PTm 

y*(z(T m _i)) <p m ~ l / pg{z(t)) dt + pV*(z(T m )) + 3x. (19) 

^ I'm-1 


Set A* = A 1 o Tl A 2 o T2 ■ ■ ■ o Tkl kL fc . For every z e A*, (1 19 1) holds for all m = 1,..., fc. 
Now, taking at hrst m — 1, and subsequently using (1 19 j) for m = 2, 3 ..., k , we have 


But then, 


K(*(0)) 


1191) 

< 


K < /z'V t, [l[o iTfc ]] + | 


m r Ti 

< / pg(z{t)) dt +pl4(z(ri)) + 3x 
Jo 

03B r 2 

< / pg(z(t)) dt +p 2 V*(z(t 2 )) + 6x 
Jo 

ffUl r T k 

< / pg(z{t)) dt + p k V*(z(r k )) + 

Jo 

n~3l i r T k 2 

< J fig(z(t))dt+~. 

as well. Now, by (1 18 p , we have 


h^ b [l[o,r fc ]] < V^w^+Se, 

K < + 3 ^ + —. 

Since by the choice of k we have ek > 2, we obtain K < V b [ci7 M ] + 4e for all p < p, which 
was to be proved. □ 


7 Proof of Proposition El 

Let us show that, for every positive e < 1, there exist ft > 0 such that, for all positive p < p 
for all w 6 O, we have 

V b [<hJM > KM - 6e. 

Fix a positive e < 1. 

For every positive p , dehne T(/r) = g[^](l — e) by J Q T ^ ^(t) dt — l — e. By (T3~fl , there 
also exist positive M, p such that 

Var^\ lnfy] dt < M 

for p < p. Fix M, p. 

Since, for all t e [0, T(//)] , we have <; /t (0) < e M <^(t), integrating over this interval, we 
obtain <^(0) < e M /T(p). Then, <^(t) < e 2M /T(p). Thanks to (jTj) , we have T(p) —» oo as 







/! I 0. Thus, for every positive A, there exists a positive /x(A) < /x, such that, for positive 
/x < /x(A), we obtain ^(t) < A for all t G [0,T(/x)]. 

There exists a natural k > M such that e M//fc < 1 + e, ke > — ln£. Then, k{l — < e. 

Fix such k. 

Set 


A 

X = 


£ 

3A-2’ 


P = e 1/k \ 



1 — p < 


£ 

k' 


( 20 ) 


Then, 5 + p5 +-b p /c “ _1 (5 — 1 — £, 5 + p5 + p 2 5 + • • • = 1. 

According to the condition of Theorem [2] and the result of Lemma [2j for a certain positive 
A(k) for all positive A < eX(k), we have 


V b [w A ] - K < x, K - V b [c^[w x ]] < x. 


( 21 ) 


Then, by definition of /x, for some positive /x < /x(eA (k)) , we also have <^(f) < £\ (k) for 
all t G [0, T(/x)] , /x < ft. Fix such /x > 0. 

Note that T(/x) = g[? M ](l —p fc2 ). Let us partition the interval [0,T(/x)] = [0, g[y M ](l — p fc2 )] 
into k 2 nonempty intervals [r m , t to+1 ) ( rn — 1 ,..., k 2 ); to this end, define 


a f Trn Su(£) dt v m -1 _ 

n r l/i m\ \ A J T m -i ' ¥ ¥ 

T 0 0, T m 9[^/xJ(l P )) '^TTi 


Tm T m _i 


Tm T m —1 


Note that the choice of \± implies that A m < £A(fc) for all m = 1 , ...,/c 2 . Then, for all 

m = 1,..., k 2 , 

A m p x - m < X m p~ m < X m p~ k2 = XJ £ < X(k), 

which implies the validity of (1211) for X m p l ~ m and X m p~ m . 

Call an interval [r', t") C R correct if Var^" [hi <, /t ] dt < M/k. 

Let us define a scalar function C on R> 0 by the following rules: C(t) = 0 for all £ > T(/x); 
C(t) = ^(i) for all t G [r m _i,T m ) if this interval is correct, and C(t) = A m for t G [r TO _i, r m ) 
if this interval is incorrect. 

Note that in this case there are at least k 2 — k correct intervals of k 2 . Then, the integrals 
of the functions and q' over the incorrect intervals do not exceed kS < £ by (1 20 II . Now, 
for all z G K, we have 

roo roo 

/ s'(t)g(z(t)) dt - q^(t)g(z(t)) dt < 2e. 

■Jo Jo 


In particular, 


V b y > vy] - 2e. 


(22) 







Note that s' was introduced such that, for all m = 1,..., /c 2 , we have 




IrZ 1 ? '( f ) _ frZl dt _ ~ P™ 

7"m 7~m —1 'I'm ^~m— 1 ^~m— 1 

A m < sup ?'(f)<e M/fc inf S(t), 

te[Tm-l,T m ) te[r m _l,T m ) 

PTm 

\ m g{z{t))dt < e M/k / g'{t)g(z{t))dt. 


(23) 


Remember that mi 1 ^ = pw v / v for all positive z/. Now, for all z£l, the density w\ mV \- 
satishes 


|i— p) _ 

^ Am pl-m ~ P^X^p-rn, 


sfaw-™ K 1 ~P) = 


1 — P 
\mP l ~ m 


^~m 7 ~m— 15 


T~m — 1 


c 11 P> [^A mP i—](-) = / A^P 1 m 0 (z(t)) dt + pV k [^\ m p-m] (z(r m - T m _i)). (24) 

Jo 

Consider any of rri — 1 ,,k 2 . By (1 21 1) , p m ~ 1 V* — x does not exceed the lower value of 
the game with the payoff p m ~ 1 <c ] ' l - p \w\ rn pi-rn \, i.e., (see (1 24 p ), the payoff 




I \ m g(z(t)) dt + p m V b [rn AmP - m ] (z(r m - r m _i)). 

Jo 

Thanks to (1 21 11 and the definition of A m , that does not exceed the value of the game with 
the payoff 

PTm J~m — 1 

/ X m g{z{t)) dt + p m V*(z(T m - T m _i)) + x. 


Then, in the last game, there exists an x-optimal strategy A m G 21, i.e., for all z G A m , the 
number p m ~ 1 V^(^z( 0)) does not exceed 


f*Trn J~m— 1 


A m g(z(t)) dt + p m V*(z(T m - r m _i)) + 3x. 

./o 

Since the right-hand side of these inequalities depends only on 2;|(o,r m -r m _i), the strategy 
A m can be substituted here with any strategy A G 21 that can be represented in the form 
A m A”. Now, for all A’ , A" G 21 ,z e A' o Tm _ x A 2 o Tm A" , 


p’"- 1 K(3(T m _ 1 )) < 


A m #(;z(f)) dt + p m K(^(T m )) +3x 


< e M/fc / g’(t)g(z(t)) dt + p m V*(z(r m )) + 3x. (25) 


Set A* = A 1 o Tl A 2 o T2 ■ ■ ■ o Tkl A k2 . For every z <E A* , (I 25 1) holds for all m — 1 ,... ,k 2 . 
Now, taking first m — 1 and accounting for To = 0, and subsequently applying (1 25 1) for 















m = 2, 3 ..., k 2 , we see that 


123 

e M/k 

< 

m 

e M/k 

< 



< 


123 

e M/k 

< 


r T fc2 


< e M/k q'(t)g(z(t)) dt + p k V*(z{r k 2 )) + 3/d 


K. 


for all z G A*. Recall that e M ^ k < 1 + e. Then, we obtain 


1201 


rT k2 


14 ( 2 ( 0 )) < e M/k <J(t)g(z(t)) dt + 2e 

Jo 

poo 

< e M/k -l + / <J(t)g(z(t))dt + 2e 

Jo 

poo 

I q'(t)g(z(t)) dt + 3e MzeA*. 


< 


Now, 

14 ( 2 ( 0 )) < V b [4](2(0)) +3e 

for all 2 G A*. In view of (I 22 1) , we proved that 14 < + 6e for all sufficiently small 

positive /i. □ 
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